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I 

Local gauge freedom in relativistic quantum mechanics is derived from a measure- 
| ment principle for space and time. For the Dirac equation, one obtains local {7(2, 2) 

■ gauge transformations acting on the spinor index of the wave functions. This local 

?7(2, 2) symmetry allows a unified description of electrodynamics and general relativity 
^ ■ as a classical gauge theory. 

cn ; 

1 Connection between Local Gauge Freedom and Position 
Measurements 

; 

Q\ | In [3], it was suggested to link the physical gauge principle with quantum mechanical 

measurements of the position variable. In the present paper, we will extend this concept 
to relativistic quantum mechanics and apply it to the Dirac equation. For Dirac spinors, 
Qh! we obtain local {7(2, 2) gauge freedom. Our main result is that this {7(2, 2) symmetry 

allows a natural description of both electrodynamics and general relativity as a classical 
gauge theory. This is shown by deriving a {7(2, 2) spin connection from the Dirac operator 
and analyzing the geometry of this connection. Although we develop the subject from a 
particular point of view, this paper can be used as an introduction to the Dirac theory in 
curved space-time. 

In contrast to [3], where the point of interest is the measure theoretic derivation of local 
gauge transformations, we will here concentrate on the differential geometry of the Dirac 
operator. In order to keep measure theory out of this paper, we will use a bra/ket notation 
in position space. This allows us to explain the basic ideas and results of [3] in a simple, 
non-technical way which will be sufficient for the purpose of this paper. Nevertheless one 
should keep in mind that the bra/ket symbols and the (5-normalizations are only a formal 
notation; the mathematical justification for this formalism is given in [3]. 

We begin with the example of a scalar particle in nonrelativistic quantum mechanics. 
The particle is described by a wave function ^(x), which is a vector of the Hilbert space 
H = L 2 (IR 3 ). The physical observables correspond to self-adjoint operators on H. The 
position operators X , for example, are given as multiplication operators with the coordi- 
nate functions, X 1 : ^f(x) — > x l ^?(x), i = 1, ... ,3. Our definition of the Hilbert space 
as a space of functions in the position variable was only a matter of convenience; e.g. we 
could just as well have introduced H as functions in momentum space. Therefore it seems 
reasonable to forget about the fact that H is a function space and consider it merely as 
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an abstract Hilbert space. After this generalization, which is often implicitly assumed in 
quantum mechanics, we must construct the representation of H as wave functions. For 
this purpose, we choose an "eigenvector basis" \x> of the position operators, 

X 1 \x> = x l \x> , <x\y> = 5 3 (x-y) , (1) 

and define the wave function of a vector ^ G H by ty{x) = <x\ \t>. 

This "position representation" of the Hilbert space is quite elementary and is currently 
used in physics. We point out that it is not unique, because the "eigenvectors" \x> are 
only determined up to a phase. Namely, we can transform |x> according to 

| f> ^ e ieA(x) | f> ( 2 ) 

with a real function A(x). This corresponds to a local phase transformation 

-» e~ ieA W -$(x) (3) 

of the wave functions. The arbitrariness of the local phase of the wave functions can also 
be understood directly from the fact that the wave function itself is not observable, only its 
absolute square \^\ 2 has a physical interpretation as probability density. It is important for 
the following that the local phase transformations ([21) , (EI) can be interpreted as U (1) gauge 
transformations. To see this, we consider the canonical momentum operator tt = —iS/ — eA 
with magnetic vector potential A. Under the transformation ([2]), the canonical momentum 
behaves like 

- ^ e -ieA{S) j e ieA(x) = _ ^ _ ^ 

which corresponds to the gauge transformation A — > A — VA of the vector potential. 

In this way, we have explained the local U(l) gauge symmetry of the magnetic field 
from the fact that space is a quantum mechanical observable. Unfortunately, this natural 
explanation of local U(l) gauge freedom only works for scalar particles. In the general 
case with spin, the wave functions have several components, \P £ H = L 2 (IR 3 ) m (i.e. 
m = 2s + 1 for particles with spin s). We again consider H as an abstract Hilbert space. 
In order to construct the representation of a vector $ 6 F as a function, we choose an 
"orthonormal basis" \xa>, x G IR 3 , a = 1, . . . ,m of the position operators, 

X 1 \xa> = x l \xa> , <xa | y(3> = 8 a p 5 3 (x — y) , (4) 

and define the wave function by ^f a (x) = <xa \^f>. The basis \xa> is only unique up to 
local unitary transformations, 

m 

\xa> -> ^(t/- 1 )^) \x/3> with U(x) € U(m) , (5) 

13=1 

as is verified by substituting into dH). Thus the wave functions can be transformed ac- 
cording to 

-» U(x)^(x) . (6) 

It would be nice if we could again identify the local U(m) transformations ((SJ), ([6]) 
with physical gauge transformations. Then the local gauge principle would no longer be 
an a-priori principle in physics. It would be a consequence of a quantum mechanical 
"measurement principle", namely the description of space with observables X 1 on an 
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abstract Hilbert space. Notice that the local U(m) transformations (|6]) are a generalization 
of the phase transformations ([3]). Our idea is that the additional degrees of freedom of 
the larger gauge group in ([6]) might make it possible to describe additional interactions 
(like gravitation or the weak and strong forces). Notice that, as a great advantage of 
our procedure, the local gauge group could no longer be chosen arbitrarily; it would be 
determined by the configuration of the spinors. Thus we could hope to get into the position 
to explain the gauge groups observed in physics. At the moment, however, it is not clear if 
the interpretation of ©,([6]) as gauge transformations really makes physical sense. We will 
in the following simply assume it as a postulate and want to study its further consequences. 

In view of our later generalization to curved space (and curved space-time), it is con- 
venient to introduce the spectral measure dEg of the position operators: We form the 
"projectors" Eg on the "eigenspaces" of X, 

m 

Eg = \xa><xa\ . (7) 

a=l 

The Eg do not depend on the choice of \xa>; we can characterize them by the operator 
relations 

X 1 E s = x l Eg , Eg E$ = 5 3 (x- y) Eg . (8) 

The ^-normalization in ([8]) leads to difficulties as soon as curved coordinate systems are 
considered. To avoid these problems, it is useful to combine Eg with the integration 
measure by introducing the spectral measure dEg := Eg dx. Integrating over the spectral 
measure yields an operator on the Hilbert space. For example, we have 

X 1 = f x l dEg , xv = [ dE x , (9) 

where xv is the multiplication operator with the characteristic function (i.e. (xv^)( x ) is 
equal to *$>(x) if x £ V and vanishes otherwise). These relations can be verified directly 
with the help of (|4|) and (jTj) . Actually, the spectral measure is completely characterized 
by ([9]). In the mathematical paper [3], spectral measures on a manifold are used as the 
starting point. In this more general approach, the number of components of the wave 
functions m = m(x) may vary in space. We call it the spin dimension. 

2 Generalization to Dirac Spinors, [7(2,2) Gauge Symmetry 

If our explanation of local gauge freedom shall have general significance, it must be possible 
to extend it to the relativistic setting. We start the analysis with Dirac wave functions in 
Minkowski spac^]. Since we do not yet want to introduce Dirac matrices and the Dirac 
operator, we consider the wave functions just as 4-component functions. We 

define a scalar product on the spinors, 

4 

= ^s Q (r(x))*$ a (x) with g 1 = s 2 = l J S3 = S4 = -1 , 
o=l 

(10) 

1 We mention for completeness that Minkowski space is a four-dimensional real vector space with a 

scalar product of signature (H ). A coordinate system where this scalar product coincides with the 

Minkowski metric rjij is called reference frame. We will mainly work in a fixed reference frame and can 
then identify Minkowski space with IR 4 . 
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which is called spin scalar product. It is indefinite of signature (2,2). Later, after the 
introduction of the Dirac matrices in Dirac representation, the spin scalar product will 
coincide with ^ / <I>, where ^ = ty*^ is the adjoint spinor. Our definition without referring 
to Dirac matrices will clarify the relation between coordinate and gauge transformations 
in section El 

The basic question is how we want to describe space-time. At the moment, position 
and time are merely parameters of the wave functions. This is the usual description in 
relativistic quantum mechanics. In the previous section, however, it was essential for the 
explanation of local gauge transformations that space corresponds to some operators on 
the wave functions. In order to generalize this "measurement principle" to the relativistic 
setting, we introduce "observables" for space and time as multiplication operators with 
the coordinate functions, 

X i : *(a?) -► x^ix) , i = 0,...,3 . (11) 

We point out that these operators, especially the time operator X°, are commonly not 
used in relativistic quantum mechanics. Nevertheless our description of Minkowski space 
makes sense, as we will see in the following. 

The next step is the introduction of a scalar product on the wave functions. Usually 
one considers the positive product 

(tf|$)= / ^*(t,x)$(t,x) dx , (12) 
Jm 3 

where the spinors are integrated over a space-like hypersurface t = const (we avoid the 
notation <I/*<I> = <J/^°<I> with the adjoint spinor because we have no Dirac matrices at the 
moment). The integrand of (\& | \I/) can be interpreted as the probability density of the 
particle. For solutions of the Dirac equation, current conservation implies that (|12p is 
independent of t. As an apparent problem of this scalar product, time measurements do 
not make sense, because expectations (^ | X° | \&) = t (\P | \P) depend on the choice of 
the hypersurface. The only way out is to introduce a different scalar product where the 
spinors are also integrated over the time variable, namely 

<^|$>= f^m\<5>y~d i x . (13) 

J IR. 

In contrast to (|12p . the scalar product (|13p has no immediate physical interpretation. 
Nevertheless it causes no problem to consider (113p as the fundamental scalar product on 
the spinors, for the following reason: The scalar product does not enter in the physical 
equations (i.e. the Dirac equation and the classical field equations), we only need it for 
the probabilistic interpretation of the wave functions. For this final interpretation, we can 
just introduce (|12p by an additional mathematical construction (see equation (|27p ). 

We choose (fT3j) as the fundamental scalar product on the wave functions. We de- 
note the corresponding function space by (H,<.\.>) and consider it together with the 
position/time operators (fTTj) as an abstract scalar product space. In order to represent 
the vectors of H as wave functions, we choose an "eigenvector basis" \xa>, x G IR 4 , 
a = 1, . . . , 4 of the position/time operators, 

X % \xa> = x l \xa> , <xa \ y(3> = s a 5 a p 5 A (x — y) , (14) 

and define for G H the corresponding wave function by *$> a (x) = <xa \ This is 

a relativistic analogue of (j4j). The additional factors s a = ±1 occur as a consequence of 
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the indefinite spin scalar product. The arbitrariness of the choice of \xa> leads to the 
transformations 

4 

\xa> -> \x(5> 
(3=1 

V(x) -> with C/(x) G [7(2, 2) , (15) 

which we interpret as local U(2, 2) gauge transformations. According to this interpretation, 
we call \xa> a gauge. Notice that the gauge group is non-compact, which is directly related 
to the indefiniteness of the spin scalar product. Similar to the nonrelativistic case, the 
spectral measure is given by 

4 

dE x = 2_, s /3 \xf3><x(3\ d x 

13=1 

We say that the spin dimension is (2, 2). 

We remark that the indefiniteness of the scalar product < . | . > leads to mathematical 
problems in the rigorous derivation of gauge transformations. A first, but not fully con- 
vincing attempt towards a satisfying mathematical formulation was made in [2] (which 
also contains a preliminary version of [3]). 



3 The Free Dirac Operator, Lorentzian Transformations 

Having introduced the Dirac wave functions in Minkowski space, we can now define the 
free Dirac operator: We fix the reference frame, choose a special gauge, and introduce the 
partial differential operator ify := irfdj on the wave functions, where 7 J denote the usual 
(4 x 4)-matrices in the Dirac representation acting on the spinor index, 

(a 1 , a 2 , a 5 are the Pauli matrices). Now we forget about the fact that the free Dirac 
operator was defined in a special gauge and a special reference frame and consider it as 
an operator on H. We denote this operator by G and write the free Dirac equation in the 
coordinate and gauge invariant form (G — m)^ = 0. 

The Dirac operator is Hermitian (with respect to the scalar product <.|.>). This is 
most conveniently verified in the original gauge and reference frame where G = iij!): For 
simplicity, we rewrite the factors s a in (|10p with the matrix 7 and obtain 

|$> = / -<GV I $y ]x d A x = f (i^d^Mx)* 7 $(s) d A x 

= -i [ (dj*)* ( 7 J ')*7°$d 4 x = i [ ^*7 (7°(7 i )*7°)^^ 

JIR 4 7lR 4 



[ 7 o = f ^ 1 G^\ x d A x = <f I G$> 



Note that the 7-matrices are not Hermitian (more precisely, (7 )* = 7 and (t 3 ')* = 
— 7 J for j = 1,2,3), but they are self-adjoint with respect to the spin scalar product, 
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We point out that G is as operator on H a coordinate and gauge independent math- 
ematical object (although its definition might depend on the special gauge and reference 
frame chosen at the beginning). Both coordinate and gauge transformations are merely 
passive transformations and lead to equivalent representations of G. Especially, transfor- 
mations of the space-time coordinates and of the spinors are independent of each other. 
This is a major difference to the usual description of the Dirac equation. Therefore we 
will now discuss coordinate and gauge transformations and explain what "Lorentzian in- 
variance" of the free Dirac operator exactly means in our setting. 

We first look at the Dirac operator in a general gauge, but still in the original reference 
frame. The gauge transformation (|15f) leads to 

8 

iB — > G = Ui&U- 1 = iG 3 — + B with 

Gi(x) = U(x)j j U(x)~ 1 , B(x) = iU(x)~f j (djUix)- 1 ) 

As explained before, all these operators are equivalent; of course it is most convenient to 
work in the original gauge with G = i&. 

Next we look at a Lorentzian transformation. The transformation of the space-time 
variables is described by a coordinate transformation x 3 — > h 3 k x k , where A is an isometry 
of Minkowski space. The Dirac operator transforms like 

iB — > G = i^k) d k . (16) 

Thus in the new reference frame, the Dirac operator no longer has the original form. 
This might seem to contradict Lorentzian invariance, but we must keep in mind that 
all operators related to each other by gauge transformations are equivalent. Thus we 
must look for a gauge transformation such that the Dirac operator in the new reference 
frame again coincides with id. As is shown in standard textbooks on relativistic quantum 
mechanics (see e.g. pQ), there is a (4 x 4)-matrix U(A) with 

U(A) y'Aj UiA)" 1 = 7 fc 

It is important for our purpose that this matrix is unitary with respect to the spin scalar 
product, U(A) E U(2,2). Therefore we can perform a gauge transformation with U(x) = 
U(A). In the new gauge we again have G = i3, which shows Lorentzian invariance of G. 
To summarize, we describe a Lorentzian transformation in two steps. The first step is a 
coordinate, the second a gauge transformation. If both steps are performed at once, the 
transformation of the space-time coordinates and of the spinors are related in the usual 
way. Notice, however, that the second step is only a matter of convenience; we could 
just as well work in the new reference frame in any other gauge. This splitting of the 
space-time and spinor transformation becomes possible because the conditions G 3 = 7 J 
give a link between coordinate and gauge transformations. 

We remark that it is not trivial that a spinorial equation can be described with pas- 
sive coordinate and gauge transformations. As a counterexample, we consider the Weyl 
equation: Chiral fermions are usually described by two-component spinors \ satisfying 
the equation 

*^X = (17) 

with Pauli matrices cr° = 1, a. This equation is Lorentzian invariant in the sense that 
there is a one-to-one correspondence between the solutions of (|17p in different reference 
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frames (see e.g. [I]). We assume that the Weyl operator W := ia J dj were given as a 
coordinate and gauge independent object (we will not specify the scalar product on the 
wave functions). After a Lorentzian transformation x- 7 — > A 3 ,x , the Weyl operator has 
the form W = ia k Ai dj. If W was Lorentzian invariant, it should be possible to perform 
a gauge transformation such that W = ia 3 dj. Thus there should be a matrix J7(A) with 

U(A)a k A{U(A)- 1 = a j 

Taking the trace on both sides, however, yields the equation Ag = <5g, which is only satisfied 
for A = 1. We conclude that the Weyl operator cannot be described as a Lorentz invariant 
operator in our setting (this is not a serious problem because the Weyl equation can always 
be obtained as the left- or right-handed component of the massless Dirac equation). 

4 General Definition of the Dirac Operator 

In the previous section, we introduced the free Dirac operator as an operator on the 
wave functions in Minkowski space. We point out that this definition involved some 
global assumptions on space-time. First of all, the reference frames gave global coordinate 
systems. Furthermore, we chose a special gauge at the beginning and described Lorentzian 
transformations using global gauge transformations (i.e. transformations independent of 
x). According to the principle of general relativity, however, global conditions do not make 
sense. In this section, we replace them by corresponding local conditions, which will yield 
the general definition of the Dirac operator. 

We start with the observation that the Minkowski metric can be derived from the Dirac 
matrices by rfi 1 = ^ {G l , This allows us to forget about the Minkowski metric and 
view Minkowski space simply as a four-dimensional vector space. For the generaliza- 
tion to curved coordinate systems, we replace this vector space by a four-dimensional 
(smooth) manifold M. Notice that M has no Lorentzian structure; we will later recover 
the Lorentzian metric from the Dirac operator. 

Next we must generalize the scalar product space (H,<.\.>) and the position/time 
operators to the setting of a space-time manifold. This can be most easily done with the 
help of spectral measures: We consider an indefinite scalar product space (H, <.|.>) and 
introduce a spectral measure {dE x ) x ^M on M of spin dimension (2,2). For any chart 
(x % , U), we define corresponding position/time operators by 

X* = ( x l dE x . (18) 

Ju 

We choose an "eigenvector basis" \xa>, x G U, a = 1, . . . , 4 of the X 1 , 

X % \xa> = x l \xa> , <xa \ y(3> = s a 5 a! 3 5 4 {x — y) , (19) 

and represent a vector ^ E H as the wave function ty a (x) = <xa \^>, x £ U. This 
construction generalizes (|9l). ()14p to curved coordinate systems. The spectral measure in 
([18p allows us to extend the usual changes of charts to the level of a calculus of operators. 

After these preparations, we can introduce the Dirac operator G. First of all, it shall 
be a Hermitian operator on H. In a representation of H as wave functions, we assume it 
to be a partial differential operator of first order, i.e. 

G = iGp£ T + B (20) 
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with (4 x 4)-matrices G 3 (x),B(x). Since G is (as an operator on H) a coordinate and 
gauge independent object, the matrices G 3 (x) and B{x) have a well-defined behavior 
under coordinate and gauge transformations. As additional condition, we demand that 
the matrices G 3 shall coincide locally with the usual Dirac matrices: 

Def. 4.1 A partial differential operator G on H is called Dirac operator, if for any 

p £ M there is a chart (x l , U) around p and a gauge \xa> such that G has the form Ii20\) 



This definition can also be understood in more physical terms: Let us assume that an 
observer (e.g. a spacecraft passing through the space-time point p) chooses a coordinate 
system and gauge satisfying (|2ip . Since the Dirac matrices at p coincide with those of the 
free Dirac operator, the observer at p has the impression that space-time locally resembles 
Minkowski space. Thus we expect that the Dirac operator gives a Lorentz metric g^, 
which coincides with the Minkowski metric at p, gjk(p) = Vjk- We can really derive this 
Lorentz metric from G; it is most conveniently given by 



In this way, our definition of the Dirac operator incorporates the principle of general 
relativity; the coordinate and gauge satisfying (|21h gives a local reference frame. If the 
partial derivatives djgi~i(p) of the metric also vanish (which can be arranged by choosing a 
so-called "normal coordinate system" ) , the reference frame is a "local system of inertia" , 
where the observer feels no gravitational force at p. It might happen that djG k (p) ^ 
although djgkiip) vanishes. This corresponds to a "spin-mixing force" which is related 
to our £7(2,2) symmetry. The matrix B{x) can be used for the description of additional 
interactions, especially of electromagnetism G = i(j) + eA*. Note that B{x) was not at all 
specified in our definition of the Dirac operator. This will turn out to be too general for 
physical applications. We postpone the discussion and specification of B(x) to section [6l 
Unfortunately, the equation (|19h for the "eigenvector basis" is too simple and must 
be modified. This is only a minor technical point, but nevertheless we give it in detail: 
The <5 4 -distribution in <\19h is useful in combination with the integration measure d 4 x, 
because integrations can be easily carried out using the relation J f{x) 5 (x) d A x = /(0). 
On a manifold, however, one works with the invariant measure dfi := \f\g\d A x instead of 
d^x. In order to compensate the factor y/\g\ of the integration measure in the integral 
over the 5-distribution, we must view the combination \g(y)\~z 5 4 (x — y) as the "invariant 
^-distribution" at a point y of the manifold. Thus we are led to replace (fT§|) by 

X 1 \xa> = x % \xa> , <xa\yj3> = s a 5 a p 8 A (x — y) . (23) 



This definition of a gauge is much better than (|19|) because \xa> now behaves under 
coordinate transformations x — ► x'(x) simply like \xa> — > \x'a>. We again introduce the 
wave functions by ^f a (x) = <xa \ ^> and obtain a representation of G as the differential 
operator (|20p . Notice that this indirect definition of a gauge by first introducing (|19p 
and then replacing it by (|23|) was necessary because we had no invariant measure at 
the beginning. We must check that this procedure is consistent: The transition from 
(fT9|) to (|23|) is described by the transformation \xa> — > |g(x)|~4 \xa>. Consequently, 



with 



GP(p) = 7 J 



(21) 



g 3k (x)l = -{G>(x), G k (x)} 



(22) 
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the Dirac operator transforms as G —* \g{x)\~± G \g(x)\*, and thus G 3 — ► G 3 , B — > 
i? + jG 3 (dj log |<7[). We conclude that the definition of the Dirac operator and the Lorentz 
metric (f22j) are independent of whether (fl~9j) or ([23]) are used. 

We will in the following always work with (|23p as the definition of a gauge. The 
freedom in the choice of the gauge again describes local £7(2, 2) transformations of the 
wave functions 

* -» t/"(x)*(s) with C7 (aj) £ ?7 (2, 2) . (24) 

We define the spin scalar product by (fTUj) . According to ([23]) . the scalar product is obtained 
by integrating over the spin scalar product, 

<tf | $> = f ^ | dfj, . (25) 
J M 

The Dirac equation is again introduced as the operator equation 

(G-m)* = . (26) 

On solutions of the Dirac equation, we finally define a generalization of the scalar product 
(fT2|) : We choose a space- like hypersurface Tt with normal vector field v and set 

(tf | $) w = / | G* | i/,- d/x w , (27) 

where d/i-^ is the measure on 7i induced by the Lorentzian metric. 



5 Construction of the Spin Derivative 

We just introduced the Dirac operator on a manifold by combining the principle of general 
relativity with our £7(2, 2) gauge symmetry. Although this definition was very natural, it 
is not clear what it precisely means: The Dirac operator is characterized by the matrices 
G 3 (x) and B(x) in (I20j) . We saw that G 3 (x) induces a Lorentzian structure and that B(x) is 
composed of potentials which might be identified with usual gauge potentials. The precise 
physical interpretation of the degrees of freedom of G 3 (x),B(x) is not clear, however. 
Furthermore, we expect in analogy to the discussion of Lorentzian transformations in 
section [3] that coordinate and gauge transformations can be linked by imposing some 
conditions on the Dirac matrices. This would imply that the freedom in choosing the 
coordinate system is related to the U(2, 2) gauge symmetry. But this relation is also very 
vague at the moment. 

In order to clarify the situation, we proceed in this section with some differential 
geometry. We will try to avoid abstract formalisms and prefer calculations in explicit 
coordinate systems and gauges. This is less elegant than a coordinate-free formulation, 
but it leads to a more elementary and direct approach. Our constructions are helpful for 
the physical interpretation, as will be explained in the next section [6l In non-technical 
terms, our aim is to rewrite the Dirac operator in a form which gives a better geometrical 
understanding of the matrices G 3 \x),B(x). 

Before starting the mathematical analysis, we point out that we consider the Dirac 
operator (apart from the wave functions of the physical particles) as the only a-priori 
given object on the manifold. This means that all additional structure (like the metric, 
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curvature, classical potentials, and field tensors) must be constructed from G. It might 
seem unusual to take only the Dirac operator as the starting point of a classical gauge 
theory. The motivation for this very restrictive procedure is the following: A physical 
theory is more convincing if it is developed from few objects, which are given from the very 
beginning and are considered as the "fundamental" objects of the theory. In relativistic 
quantum mechanics, the Dirac operator is needed in any case for the formulation of the 
Dirac equation. The objects of classical field theory, however, either enter directly in the 
Dirac operator or can be expressed in terms of derivatives of potentials occurring in G. 
We conclude that it is both desirable and might be possible to consider them as derived 
objects, which are only constructed from the Dirac operator for a convenient formulation 
of the classical interactions. 

The Lorentzian metric (I22p induces the Levi-Civita connection V, which gives a parallel 
transport of vector and tensor fields along geodesies. The geometry of space-time can be 
described as usual with the Riemannian curvature tensor Kj kl , and we could thus formulate 
general relativity by writing down Einstein's field equations. This procedure only tells 
about part of the geometry, however. We did not at all use the matrix B(x) and did not 
take into account the £7(2,2) gauge symmetry. If we want to understand all the degrees 
of freedom of the Dirac operator geometrically, it is a better idea to study the parallel 
transport of spinors (instead of tensor fields). In the infinitesimal version, this parallel 
transport is given by a U(2, 2) gauge covariant derivative D on the wave functions, which 
we call spin derivative. It is characterized by the condition that the derivative Dj*f> of 
a wave function shall again behave under gauge transformations according to (|24p. The 
basic question is if the Dirac operator induces a spin derivative. 

For the construction of the spin derivative, we will analyze gauge transformations 
explicitly. In a special gauge and coordinate system, D shall have the representation 

Dj = ^j-iC J (x) (28) 

with suitable (4 x 4)-matrices Cj(x). Under a gauge transformation (|24f) . it transforms 
like 

Dj -» UDjU- 1 = dj-iUCjU- 1 + UidjU' 1 ) . (29) 
Thus our aim is to find matrices Cj(x) satisfying the transformation rule 

Cj -> UCjU- 1 + iUidjU- 1 ) . (30) 

These matrices must be formed out of G J (x),B(x), which behave under gauge transfor- 
mations like 

G j -> UG j U~ l , B -> UBU- 1 + iUG^ipjU- 1 ) . (31) 

It is far from being obvious how an explicit formula for the matrices Cj should look 
like. But we can already say something about its general structure: since only first 
derivatives of U occur in (|30p . we will only use G J ,B, and first derivatives of the Dirac 
matrices d^G 1 for the construction. As a consequence, we need not care about second 
derivatives of U; they will not enter in our calculations. Constant gauge transformations 
(i.e. transformations U with djU = 0) are also irrelevant, because they only describe a 
common unitary transformation . — > U . U^ 1 of all matrices. The point of interest is the 
first order term iU(djU~ 1 ) in (|30p . 
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We introduce some notation: In analogy to the bilinear and pseudoscalar covariants of 
the Dirac theory, we define the matrices 

a jk (x) = % - [G j , G k ] , p(x) = - e jklm G 3 G k G l G m 

In a local reference frame (|2ip . they coincide with the usual matrices o~ 3k ,j 5 in the Dirac 
representation (we use the notation p instead of 7 s , because the "tensor index" 5 might 
be confusing on a manifold). A (4 x 4)-matrix is called even or odd if it commutes resp. 
anti-commutes with p. The matrices 

G j , pG j , 1 , ip , a 3k (32) 

form a basis of the 16-dimensional (real) vector space of self-adjoint matrices (with respect 
to -<.|.>-). Gr> , pG 3 are odd; 1, ip, and a jk are even. 

Since we must expect the matrices Cj to be complicated expressions in G 3 and B, 
it is useful to first consider special gauges where these expressions should have a simple 
form. For this purpose, we study the term VkG 3 = d^G 3 + T 3 kl G l . Under coordinate 
transformations, it behaves like a tensor. Under gauge transformations, however, first 
derivatives of U occur, 

V k G j -^V k {UG 3 U- 1 ) = UiVkG^U- 1 + (dkU^U- 1 + UG^dkU' 1 ) 

= UiVkG^JJ- 1 - klidkU- 1 ), UGW- 1 ] . (33) 

We can use the second summand in (|33p for a partial gauge fixing: 

Lemma 5.1 For any space-time point p € M there is a gauge such that 

V k G 3 \p) = . (34) 

Proof: Notice that the matrix djp is odd, because 

= d 3 l = dj(pp) = (d j p)p + P (d j p) . (35) 

We start with an arbitrary gauge and construct the desired gauge with two subsequent 
gauge transformations: 

1. According to (135)) . the matrix ip{djp) is self-adjoint. We can thus perform a gauge 
transformation U with U(p) = 1, djU(p) = ^p(djp). The matrix djp(p) vanishes in 
the new gauge, since 

djp\ p -> dj{UpU- l )\ p = djp lp + ^ [p(djp), p] [p = djp\ p - p 2 {djp)\ p = 

By differentiating the relation {p, G 3 } = 0, we conclude that the matrix V^G^ is 
odd. We can thus represent it in the form 

V fc Gf p = A^ m G^ + ®{ m P G m (36) 
with suitable coefficients A^, m , 0^, m . 
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This representation can be further simplified: According to Ricci's Lemma, we have 
at p 

= 2V„^ fc = {V n G j , G k } + {&, V n G k ] 

= 2Ai m g mk - Qi m 2 ip a mk + 2A k nm g^ - Q k nm 2ipa^ , (37) 

and thus 

In the case j = k ^ m, ([37|) yields that & J nm = 0. For j ^ k, we obtain 0^ ■ a^ k + 

®nfc = an d t nus ©nj = ©nfc 0' an d ^ denote fixed indices, no summation is 
performed). We conclude that there are coefficients 0^ with 

©L = GktL . (39) 

2. We perform a gauge transformation U with J7(p) = 1 and 

d k U = ~@ k p- % -t^ n g nl a ml 

Using the representation f)36[) together with (|38p .(|39 p . the matrix V^G- 7 transforms 
like 

V fc G J -> V fe G J + [d fe C/, G J ] 

= A J fcm G m + fe pG^ - fe pG^ - % - Af n g nl [a ml , CP] 

= Km G m ~ \ Kn 9 nl 2* (G m 5j - d 51) 

= K m G m + \Kn9 n3 G m - \K{ m G m = . ■ 

In general, the condition V^G- 7 can only be satisfied "locally" in one point p € M. We 
call a gauge satisfying (f3~il) a normal gauge around p. 

Now we look to which extent the gauge is fixed by Lemma 15.11 According to (|33p . a 
transformation between normal gauges must satisfy the condition [[/(c^-t/ -1 ), UG^U" 1 ]^ = 
0. As a consequence, the matrix iU(djU )\ p must be a multiple of the identity, as is 
verified in the basis (|32p using the commutation relations between the Dirac matrices. 
Since constant gauge transformations and higher than first order derivatives of U are 
irrelevant, we can assume that U(x) itself is a multiple of the identity. In other words, we 
can restrict ourselves to U(l) gauge transformations. This is very helpful because in this 
special case there is a simple expression showing the transformation law (|30p . namely 

- Re Tr(Gj B) 1 -► - Re Tr(Gj B) 1 + - Re Tr [Gj G k iUidjlJ- 1 ) l) 

= - Re Tr(Gj B) 1 + iU(djU- 1 ) 
We can identify this expression with Cj and use (|28p as the definition for a spin derivative: 
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Def. 5.2 We define the canonical spin derivative D by the condition that it has in 
normal gauges around p the form 

In general gauges, the canonical spin derivative can be written as 

Dj = JL _ iEj - "-ReTriGjB)! (41) 

with additional matrices Ej(x). These matrices are characterized by the condition that 
Ej(p) vanishes in normal gauges around p and behaves according to ([30j) under SU (2,2) 
gauge transformations. They are given by the formula 



Ej = \ p(d jP ) - ^ Tr(G m V,G n ) G m G n + l - Tr (pGj V m G m ) p . (42) 



as can be verified by a straightforward calculation. Unfortunately, this expression is rather 
complicated. We will in the following always avoid working with Ej explicitly. 

The canonical spin derivative has some nice properties. We first collect them in a 
theorem and discuss them afterwards. 

Theorem 5.3 The canonical spin derivative satisfies the equation 

[D k ,&] + T^G 1 = . (43) 
It is compatible with the spin scalar product, 

dj ^ | = ^Dj^ | §y + | Dj<f>y . (44) 

The operator iG 3 Dj is a well-defined Hermitian operator on H . Furthermore, 

- iVj <G 3 ^ | = ^iG j Dj$> | $> | iG j Dj$y . (45) 



Proof: The left side of (I43j) behaves under gauge transformations according to the adjoint 
representation . — ► U . U^ 1 of the gauge group. Thus it suffices to check (|43p in a normal 
gauge, where 

[D k , G j ] + T J kl G l = V k G 3 - % - Re Tr(GjB) [1, G j ] = 



For the canonical spin derivative, the matrix Cj in (|28p is self-adjoint, as can be verified 
with ([41]) . ([32 ]) . This immediately implies (f4"4"|) . 

According to its behavior under coordinate and gauge transformations, we can view 
iG 3 Dj as an operator on H. Relation (|44p yields the equation 

Vj^G% | <S>y = ^DjG j V | + | Dj$y + T j jk ^G k ^ \ <S>^ 



Applying (1431) and the self-adjointness of the Dirac matrices, we obtain (J45J). If we integrate 
this equation with respect to the invariant measure d\i = ^J\g\d A x, the left side vanishes 
with Gauss' theorem. On the right side, we substitute (|25[) and conclude that the operator 



iG 3 Dj is Hermitian. 
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Generally speaking, this theorem shows that the connections V,-D, the Dirac matrices G' J , 
and the scalar products g^ % -<\>- can be consistently used in combination with each other, 
if all expressions are written in a coordinate and gauge invariant form (by appropriately 
using Vj, Dj, and PL). We can leave out the derivatives of G J and may use the product 
rule inside the spin scalar product. Relation (|44jl formally corresponds to Ricci's Lemma 
in Riemannian geometry if we replace the metric by the spin scalar product and V by D. 

Since the operators G and iG J Dj are both Hermitian and coincide up to zero order 
contributions, we can represent the Dirac operator in the form 

G = iG j Dj + H (46) 

with a self-adjoint matrix H{x). Under gauge transformations, H simply behaves accord- 
ing to the adjoint representation. This is an advantage over the formula (|20p . where the 
transformation of B involves derivatives of U. 

Equation (|45p implies a general version of current conservation: Assume that \& and 
<I> are eigenvectors of G with real eigenvalue m. Then the vector field -<G 3 ^ \ is 
divergence- free, 

-iVj^G 9 '* | *>- = <{G-H)^ | $> <*| (6? -#)$;>- = 

It is a natural generalization of the electromagnetic current ^f^^f of the Dirac theory. 
According to Gaufi' law, the current conservation implies that the scalar product (|27p is 
independent of the choice of the hypersurface TL. The integrand of the scalar product 
(^ | has the interpretation as the probability density of the particle. 

The Christoffel symbol in (|43p indicates that the spin derivative in some way includes 
the Levi-Civita connection V. The following bundle construction makes this relation more 
precise: We denote the sections of a vector bundle N over M by T(N) (e.g. T(TM) are 
the vector fields on M). The Levi-Civita connection is a connection on the tangent bundle 
TM, 

V : T P M x T(TM) -» T p M : (X, Y) -» V X Y 

We can view the wave functions as sections of a vector bundle SM with fibre (C 4 , -< . | . y ) . 
In order to describe the U (2, 2) gauge symmetry, we make this spin bundle to a prin- 
cipal bundle with structure group U(2,2), which acts on the fibre in the fundamental 
representation .—►?/.. The spin derivative is a connection on the spin bundle, 

D : T p M x T(SM) -» S p M : (X, *) -» D x ^ 

We define L p M as the vector space of self-adjoint transformations of S P M. In a special 
gauge, L p M can be represented as (4 x 4)-matrices; (|32|) gives an explicit basis. The gauge 
transformations act on the matrices according to the adjoint representation . — > U\ p . UZ . 
We denote the corresponding principal bundle by LM. The spin derivative induces a 
connection on LM by 

D : TpM xT(LM) -> L p M : (X,A)->[D X ,A] . (47) 

We can view the tangent bundle as a subspace of LM, because there are canonical em- 
beddings and projections 

l v : T p M «-» LpM : X -> X 1 G { 

Tip : LpM — > T p M : A i Tr(G^) A 
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with 7Ti = id. By restricting the connection (|47j) to the tangent bundle, we can form a 
connection V on TM, 

V : T p M x SiTM) -» T p M : (X,Y) -» vr tY] 

Equation (143D states that this connection coincides with the Levi-Civita connection, since 

[D X , tY] = Xi [Dj, G k Y k \ = Xi [(d,Y k ) G k - rj, Y k G l ) 

= X* (VjYk) G k = l V X Y (48) 

and thus 

V X Y = tt[D x , lY) = ttlV x Y = V X Y 

In this way, the Levi-Civita connection is recovered as a suitable restriction of the spin 
connection. The geometry of the spin bundle generalizes the Lorentzian geometry of the 
manifold. 

It remains to construct the geometrical invariants of the spin connection. On an 
elementary level, one can look for expressions in D k -,G 3 which transform both like a tensor 
and according to a local representation of the gauge group. This leads to the definition of 
the 2-forms 

T jk = ^{[D^Gd-^Gj]) 

1 

Rjk = 2 ^- D ^ D k] , 

which are called torsion and curvature. 

Theorem 5.4 The canonical spin connection is torsion-free. Curvature has the form 

R jk = I Rmnjk ° mn + \ (dja k - 8 kaj ) 1 (49) 
with the Riemannian curvature tensor R m njk a^d the potential 

a,j = — Re Tr(GjB) . (50) 

Proof: Relation (|4^|) yields 

[Dj, G k ] = [Dj, g k [G l ] = (djg k i)G l - g k iT l j m G m = T™ k G m 
and thus, using that the Levi-Civita connection is torsion-free, 

We iterate formula (|48j) and express the Riemannian curvature tensor in terms of the 
curvature of the spin connection, 

G t R) kl Z l = t{VjV k Z-V k VjZ) 

= [Dj, [D k , lZ}} - [D k , [Dj, lZ}\ 
= [[Dj, D k ], tZ] = -2i [R jk , lZ] 
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This equation determines curvature up to a multiple of the identity matrix, 

Rjk(x) = -Rmnjk(T mn + Ajfcl 
o 

Thus it remains to calculate the trace of curvature, 

- Tt(R jk ) = - Tr(djC k - d k Cj) = ^ (dja k - d k aj) 

where we used (|4ip and the fact that the matrices Ej are trace-free. ■ 

We remark that the canonical spin derivative is not the only spin derivative which can 
be constructed from the Dirac operator. A general spin derivative D differs from the 
canonical spin derivative by matrices Fj = Dj — Dj which transform according to the 
adjoint representation of the gauge group. It is no loss of generality to work with the 
canonical spin derivative because the matrices Fj can also be dealt as additional potentials. 
Because of the useful properties of Theorem 15.41 it is most convenient to use the canonical 
spin derivative. 

6 Interpretation, the Physical Dirac Operator 

For the physical interpretation of the previous constructions, we return to the discussion 
of the moving observer on page We saw that the coordinate system and gauge sat- 
isfying (|2ip give a local reference frame for the observer. The gravitational field can be 
compensated locally by choosing a normal coordinate system. According to Lemma 15.14 
we can further compensate the dynamics of the Dirac matrices: in a normal gauge even 
the derivatives of the Dirac matrices vanish, d k G 3 {p) = V k G 3 {p) = 0. This means that 
the above mentioned "spin mixing force" can also be compensated locally by choosing an 
appropriate coordinate system and gauge. We cannot do even better because second order 
derivatives of the metric and the Dirac matrices involve the Riemannian curvature tensor. 
We call a coordinate system and gauge satisfying the conditions G^(p) = 7 J , d k G^(p) = 
a normal reference frame around p. 

We come to the discussion of the matrix B{x), which was not at all specified in Def. 
14.11 The strongest local condition is to impose that B(p) vanishes in a suitable normal 
reference frame around p. This condition turns out to be of physical interest, and we use 
it as the definition for the physical Dirac operator: 

Def. 6.1 A Dirac operator G is called physical Dirac operator if for any p € M there 
is a normal reference frame around p such that B{p) = 0. 

In other words, the physical Dirac operator is characterized by the condition that it lo- 
cally coincides with the free Dirac operator, which means more precisely that there is a 
coordinate system and a gauge with G 3 {p) = 7 J , d k G 3 (p) = and B(p) = 0. 

We again consider the physical Dirac operator as the only a-priori given object on the 
manifold and construct the metric and the canonical spin derivative from G. In a normal 
reference frame with G = iG 3 dj, the canonical spin derivative (I40p coincides with the 
partial derivatives, Dj = dj. Thus the physical Dirac operator has the representation 

G = iG j Dj , (51) 
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which is also valid in a general gauge and coordinate system. In comparison to the rep- 
resentation (|20p . the degrees of freedom of the matrix B(x) now occur in the canonical 
spin derivative. This has the advantage that they can be immediately identified with the 
£7(2,2) gauge potentials in (|4ip . The last summand in (|4ip describes the £7(1) phase 
transformations of electrodynamics. We define the electromagnetic potential by 

Aj = — ReTr(GjB) 

and rewrite the canonical spin derivative in the more familiar form 

d 

Dj = — — r — iEj — ieAj 
dx 3 3 

The matrices Ej take into account the gravitational field. According to Theorem 15.41 the 
curvature of the spin connection is formed of the Riemannian curvature tensor and the 
electromagnetic field tensor Fjk = djA^ — d^Aj. We can write down the classical action in 
terms of these tensor fields. The classical variational principle yields a unified description 
of general relativity and electromagnetism as a £7(2, 2) gauge symmetry. 

The definition of the physical Dirac operator can also be understood in more tech- 
nical terms with the canonical spin derivative: The Dirac operator of Def. 14.11 has the 
representation (|46p with a self- adjoint matrix H(x). According to the construction of the 
spin derivative, the matrices G 3 H are trace- free. Thus H(x) is characterized by twelve 
degrees of freedom, which correspond to additional potentials (more precisely, 1 scalar, 1 
pseudoscalar, 4 pseudovector, and 6 bilinear potentials). These potentials are not gauge 
potentials. They do not seem to occur in nature. Therefore it is a reasonable physical 
condition to assume that H vanishes. This leads to the representation (I5ip of the physical 
Dirac operator. 

Our definition of the Dirac operator introduces the gauge potentials and in this way 
replaces the usual minimal coupling procedure. We explain for clarity why simple minimal 
coupling does not make sense in the context of our £7(2, 2) symmetry: Assume that we 
had (instead of constructing the spin derivative from the Dirac operator) introduced a 
£7(2, 2) gauge covariant derivative D of the form ([28]) with gauge potentials Cj. According 
to minimal coupling, we must replace the partial derivatives in the free Dirac operator by 
gauge covariant derivatives, which gives the operator i^ 3 Dj. This operator does not make 
sense, however. First of all, it is not Hermitian (with respect to <.|.>). Furthermore, the 
current -<7- ? \& | tyy is in general not divergence- free, so that the probabilistic interpretation 
of the Dirac wave function breaks down. The basic reason why minimal coupling does 
not work is that the £7(2,2) gauge potentials Cj do in general not commute with the 
Dirac matrices j 3 . In order to bypass these problems, we must replace the 7-matrices 
by dynamical Dirac matrices G 3 {x). The two replacements dj — > Dj and -y 3 — ► G 3 must 
be coordinated in such a way that the Dirac operator iG 3 Dj is Hermitian and allows a 
reasonable definition of a conserved current. This is accomplished by our definition of the 
Dirac operator and the subsequent construction of the spin derivative. 

Our description of gravitation differs considerably from the usual formulation of general 
relativity as a gauge theory (see e.g. [5]). As one of the major differences, we avoid 
the principal bundle of orthonormal frames (with the Lorentz group as structure group); 
instead we are working with the spin bundle and local £7(2,2) transformations. This 
replacement is possible because the subgroup of £7(2, 2) generated by the bilinear covariants 
a 13 is locally isomorphic to the Lorentz group. In contrast to the connection on the bundle 
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of orthonormal frames, the canonical spin connection is torsion-free. Remember that the 
£7(2,2) spin bundle arose as a consequence of our measurement principle for space and 
time. Thus, as an advantage of our procedure, the usual bundle constructions for the 
definition of gauge fields and of spinors in curved space-time are no longer necessary. 



7 Conclusion 

We saw that the adaptation of the ideas in [3] to the relativistic context yields a local 
U(2, 2) gauge symmetry of the Dirac equation. In order to describe the physical inter- 
actions with this gauge symmetry, it is necessary to consider the Dirac operator as the 
basic object on the manifold. The gauge potentials are implicitly contained in the Dirac 
operator. By constructing the spin derivative, they are recovered as describing the electro- 
magnetic and gravitational field. In this way, we conclude that the local U{2, 2) symmetry 
in relativistic quantum mechanics makes physical sense. The concept of measurability of 
space-time gives a fundamental explanation for this gauge symmetry. 

Our description has the advantage that both the Dirac theory and classical field the- 
ory are developed from few a-priori given objects: The fermionic particles correspond to 
vectors of the indefinite scalar product space (H, <.|.>). Space-time is described by the 
spectral measure dE x on the manifold M. The Dirac operator gives the gauge potentials 
and determines the interaction between the fermions and the gauge fields. This description 
is conceptually simple. It is the starting point for further constructions which finally lead 
to the "Principle of the Fermionic Projector" as introduced in 
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